Assessment and comparing of support vector machines model and regression equations for predicting alluvial channel geometry
نویسندگان
چکیده
Determine the stable channel geometry of the river is one of the most important topics in river engineering. Various relationships (based on statistical and theoretical methods) to predict the stable channels dimensions are expressed by many scientists. In this study, three Support Vector Machines (SVM) models are designed to predict width (w), depth (h) and slope (s) of stable channel. 85 cross-section river field data is used in training and testing models. The models input parameters are the flow discharge (Q), median sediment diameter (d50) and affecting Shields parameter (τ ). Furthermore, the width, depth and slope values are calculated by Afzalimehr regression relationship. Several statistical indexes are used to check the accuracy of the models in comparison with field data. Results show that SVM models with correlation coefficient (R) 0.86, 0.66 and 0.646 in width, depth and slope prediction respectively have a good agreement with observational data. Also, the models comparison show a considerably better performance of the SVM models over the available regressions equations with a mean absolute relative error (MARE) decreasing of 72%, 20% and 11% in width, depth and slope prediction, respectively. The presented methodology in this paper is a good approach in predicting cross section geometry of alluvial rivers also it can be used to design stable irrigation and water conveyance channels. Keywords; SVM model, stable channel geometry, regression equation, observed data. Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 58 Introduction Hydraulic geometry determination of a stable channel is studied by many scientists in the last hundred years. The most important geometric parameters of stable channel are apex width, average depth and longitudinal slope. Various relationships to predict the stable channels dimensions are expressed by many scientists. Hey et al. (1986) through multiple regressions method obtained width, depth and channel slope and concluded that in addition to the effective role of flow discharge, plant coverage of channel walls affect the channel width equation and the bed grain size affects the depth and channel slope equation as well [8]. Afzalimehr et al. (2010) using non-linear regression analysis obtained the width, depth and slope of the stable channel and concluded that the average grain size (d50) and Shields parameter (τ ) are not considered as effective variables in the channel width and depth prediction and only depends on the flow discharge [1]. Lee and Julien (2006) [12] achieved the hydraulic dimension of alluvial channels by nonlinear regression and their results were compared with the Julien and Wargadalam (2006) results which was based on empirical methods and concluded that the regression equations results are in more compatible with experimental data [10]. In recent decades, soft computing methods to study the flow pattern of hydraulic structures, estimate and predict the complex phenomena in hydrology and hydraulics are used (e.g. in: curved channel (Gholami et al. 2014 [5]; Gholami et al. 2015a [6]; Gholami et al. 2015b [7]; Karimi et al. 2015 [11]); Sediment transportation (Ebtehaj & Bonakdari, 2013 and 2014a [3], [4]); Discharge capacity (Karimi et al. 2015)). Khadangi et al. (2009) used Multi-Layer Perceptron Neural Network (MLPNN) for modeling the alluvial channels hydraulic geometry and their results were compared with semi-empirical equations [15]. Tahershamsi et al. (2012) using MLPNN model predicted the stable channel width and compared their results with regression equations. Among all computational intelligence methods, the SVM model is a novel algorithm from the machine learning community [13]. Javadi et al. (2015) estimated river bed form dimension using Artificial Neural Network (ANN) and SVM models, their result show that SVM model had a higher capability for estimating and simulating height of the bed form than ANN model [9]. Beechie and Imaki (2015) Predict natural channel patterns in stable state based on landscape and geomorphic controls by SVM models [2]. Their models show good accuracy when consider all affected parameter such as discharge, valley confinement, sediment supply, and sediment caliber. In the present research, SVM model has been used to predict the width, depth and slope of stable channel. Afzalimehr’s field data in 85 cross section rivers are used for training and testing models [1]. The Afzalimehr’s regression equations in width, depth and slope prediction are used to evaluate efficiency of models [1]. Different statistical indexes are applied for evaluation of present models. Materials and methods Overview of SVM model Support Vector Machine (SVM) was first introduced based on Vapnik theory and used for regression by Vapnik et al. (1997) [14]. SVM is based on the construction of the Lagrange multipliers equation. For a certain data set: ( ) { } i i i d x G , = Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 59 SVM estimated the following function: ( ) ( ) b x w x f + = φ (1) Where di is the desired value, xi is the input vector, n is the data patterns number φ(x) is the high dimensional feature space mapping nonlinearly from input space (x) and the w and b are calculated using regularizing risk function defied as follows: The second term of Equation (2) ( ) 2 5 . 0 w × known as regularization term, C is the reqularized constant determining the trade-off between regularization term and empirical risk. Increasing the C, leads to decreasing the relative importance of regularization term in comparison to empirical risk. The ε variable is equal to estimation precision on training dataset. By introducing the positive slack variables ζi and ζi and transformed the Equation (2) in the primal function as Equation (3), w and b is estimated. By exploiting the optimally constraints and introducing Lagrange multipliers, the decision function (Equation (1)) in the explicit form is presented as follows: where K(x,xi)=φ(xi)φ(xj) and the term K(x,xi) is called the kernel function, which is product of the two inner vector xi and xj in the feature space φ(xi) and φ(xj), respectively. All calculating related φ(x) is performed using kernel function. The used kernel function in this study is Radial basis function (RBF) kernel function which is calculated as follows: Where, γ is the kernel parameter which should be carefully determined to reach the acceptance results. Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 60 Data presentation for stable channel design and experimental design The channel hydraulic parameters at stable channel include water level width (w), average flow depth (h) and channel slope (s). The study of previous researches show that two flow discharge (Q) and average particle size of sediment (d50) parameters and later Shields parameter (τ) (sediment particle moving) are important parameter to predict w, h and s at stable channel in most studies. This parameter is defined as follows: Where ρ , the mass density of water (ρ =1000 kg/m ), ρs mass density of sediment (ρs=2650 kg/m ), g the gravitational force (g= 9.81 m/s), ds the particle diameter or that relative density and ν kinematics viscosity ( ν= 1×10-6 m/s), and τ the shear stress that shear stress model with zero-pressure gradient is estimated as follows: In the present paper, three different SVM models are presented to predict width, depth and slope of stable channel cross sections. The input variables are Q, d50 and τ * parameters. The observational data related to 85 cross section river with gravel-bed in IRAN (Afzalimehr et al. 2010 [1]) that are located in stability regime condition, are used to SVM models train and test. Then Afzalimehr’ s regression relationships [1] are used to predict geometry of stable channel and also for evaluating of SVM model’ s efficiency, These equations are extended based on statistical calculations on a Afzalimehr’s dataset which these equations are as follow [1]: Statistical Analysis The performance of SVM models is evaluated with the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Relative Error (MARE), Correlation coefficient (R) and BIAS statistical parameters. Equations 13, 14, 15, 16 and 17 are used to compute RMSE, MAE, MARE, R, and BIAS respectively: Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 61 Where Yi(Observed) is the output observational parameter, Yi(Model) is the parameter predicted by the SVM models, (Model) is the mean SVM model parameter and N is the number of parameters. RMSE and MAE show the difference between the modeled and observed data in the same unit of them. Higher model accuracy leads to RMSE and MAE values closer to zero. R provides a measure of how well the observed outcomes are replicated by the model. Results and Discussion Geometric and hydraulic parameters of stable channels are predicted using SVM models in the present study. 85 cross section rivers field data of Afzalimehr et al. (2010) are utilized for training and testing models which 60 data (70% of whole data) and 25 data (30% of whole data) are used for training and testing models, respectively [1]. Figure 1 shows the regression plot to predict the width, depth and slope by SVM models in comparison with observed data. Table 2 shows the R, MARE, RMSE, MAE, BIAS error indexes between SVM models and observed data. Also, the width, depth and slope values are estimated by Afzalimehr et al. (2009)’s equations (based on statistical regression methods [1]) and are compared with observed values and this equation’s error indexes are shown in Table 1. In Figure 1, data compression has been more around exact line and also R value (in Table 1) in three models shows high accuracy of SVM models in prediction (R= 0.86, 0.66 and 0.646 in width, depth and slope prediction models, respectively). In all figures, data distribution is around the ±10% error line that shows the range of data errors. In Table 1, in width prediction model, SVM model with high R value is more accurately than Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 62 the Afzalimehr’ s equation (R= 0.860). Also in this model, relative and absolute error of MARE and RMSE have the less value in SVM model (MARE =0.062 and RMSE =0.20). In depth prediction model, SVM model with high correlation coefficient and low error values is more accurately than Afzalimehr’ s equation (R= 0.66, MARE= 0.042, RMSE= 0.202). In slope prediction model like width and depth prediction models, relative and absolute error values in SVM model are more less than Afzalimehr’s equation (MARE= 0.078, RMSE= 0.597). in general, MARE value in SVM model are more less than Afzalimehr’ s regression equations by 72%, 20% and 11% in prediction width, depth and slope parameters, respectively. Therefore, regression models performance improve using SVM models. Performance improvement of SVM model in width prediction is more than another models because in the Afzalimehr’ s width equation are existed only discharge parameter but input variables in SVM model are all three Q, d50, τ* parameter. The lowest efficiency improvement is slope prediction model because the presence of all three affected parameters in both models. Specifically, presence of τ parameter in slope prediction according on shear stress relation with zero-pressure gradient is more affected. BIAS indices show the underestimation and overestimation of models. This index value in SVM model Afzalimehr et al. (2009) equations [1] to predict the width, depth and slope is close to zero. It can be said that, this index with the same average value predict the estimation value as underestimation and overestimation. Negative and positive value indices show the underestimation and overestimation model, respectively. In present SVM and regression models, in both width and slope models, BIAS values are positive, indicating that the models are as overestimate. And negative values indicate an underestimation in predicting the depth models. Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 63 In Figure 2, width, depth and slope values predicted by SVM model are compared with observed data. SVM model predicts well velocity data trend like observed data and have a good agreement with observed data. In all three figures, SVM model predicts higher and lower values than those observed data in maximum and minimum points, respectively (except of 15-25 data range). Mesopotamia Environmental Journal ISSN 2410-2598 Mesop. environ. j. 2016, Vol.2, No.3:57-66. www.bumej.com 64 Fig .2: Comparison of width, depth and slope predicted by SVM model with observed values.
منابع مشابه
An ANFIS-based Approach for Predicting the Manning Roughness Coefficient in Alluvial Channels at the Bank-full Stage
An intelligent method based on adaptive neuro-fuzzy inference system (ANFIS) for identifying Manning’s roughness coefficients in modeling of alluvial river is presented. The procedure for selecting values of Manning n is subjective and requires judgment and skill which are developed primarily through experience. During practical applications, researchers often find that a correct choice of the ...
متن کاملA Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels
The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...
متن کاملToward a Thorough Approach to Predicting Klinkenberg Permeability in a Tight Gas Reservoir: A Comparative Study
Klinkenberg permeability is an important parameter in tight gas reservoirs. There are conventional methods for determining it, but these methods depend on core permeability. Cores are few in number, but well logs are usually accessible for all wells and provide continuous information. In this regard, regression methods have been used to achieve reliable relations between log readings and Klinke...
متن کاملApplication of Gene Expression Programming and Support Vector Regression models to Modeling and Prediction Monthly precipitation
Estimating and predicting precipitation and achieving its runoff play an important role to correct management and exploitation of basins, management of dams and reservoirs, minimizing the flood damages and droughts, and water resource management, so they are considered by hydrologists. The appropriate performance of intelligent models leads researchers to use them for predicting hydrological ph...
متن کاملPredicting tensile strength of rocks from physical properties based on support vector regression optimized by cultural algorithm
The tensile strength (TS) of rocks is an important parameter in the design of a variety of engineering structures such as the surface and underground mines, dam foundations, types of tunnels and excavations, and oil wells. In addition, the physical properties of a rock are intrinsic characteristics, which influence its mechanical behavior at a fundamental level. In this paper, a new approach co...
متن کاملEvaluation of the Efficiency of Linear and Nonlinear Models in Predicting Monthly Rainfall (Case Study: Hamedan Province)
In this research, we used the support vector machine (SVM), support vector machine combine with wavelet transform (W-SVM), ARMAX and ARIMA models to predict the monthly values of precipitation. The study considers monthly time series data for precipitation stations located in Hamedan province during a 25-year period (1998-2016). The 25-year simulation period was divided into 17 years for t...
متن کامل